Basic Statistics

Raw Counts

Name Value
Rows 29,851,450
Columns 19
Discrete columns 17
Continuous columns 2
All missing columns 0
Missing observations 65,086,922
Complete Rows 1,069,775
Total observations 567,177,550
Memory allocation 4.2 Gb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 2 columns ignored with more than 100 categories.
## res_county: 1037 categories
## county_fips_code: 1537 categories

QQ Plot

## Warning: Removed 1258 rows containing non-finite values (stat_qq).
## Warning: Removed 1258 rows containing non-finite values (stat_qq_line).

Correlation Analysis

## 2 features with more than 100 categories ignored!
## res_county: 535 categories
## county_fips_code: 725 categories
## Warning in cor(x = structure(list(case_positive_specimen_interval = c(2, : the standard
## deviation is zero

Principal Component Analysis

## 2 features with more than 100 categories ignored!
## res_county: 535 categories
## county_fips_code: 725 categories
## Warning in plot_prcomp(data = structure(list(case_month = c("2020-06", "2020-12", : The following features are dropped due to zero variance:
##  * symptom_status_Symptomatic